Sentence Correction Based on Large-scale Language Modelling

نویسنده

  • Ji Wen
چکیده

With the further development of informatization, more and more data is stored in the form of text. There are some loss of text during their generation and transmission. The paper aims to establish a language model based on the large-scale corpus to complete the restoration of missing text. In this paper, we introduce a novel measurement to find the missing words, and a way of establishing a comprehensive candidate lexicon to insert the correct choice of words. The paper also introduces some effective optimization methods, which largely improve the efficiency of the text restoration and shorten the time of dealing with 1000 sentences into 3.6 seconds.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Role of Emotioncy in Cognitive Load and Sentence Comprehension of Language Learners

Emotion and cognition are both considered influential factors in language learning. In this study, the role of "emotioncy" (which is a combination of emotion and frequency) in the cognitive load and sentence comprehension of a group of language learners was examined. Emotioncy includes emotions that are evoked by the senses. To this aim, 200 English as a foreign language (EFL) learners were ask...

متن کامل

Speech and Language Resources for LVCSR of Russian

A syllable-based language model reduces the lexicon size by hundreds of times. It is especially beneficial in case of highly inflective languages like Russian due to the abundance of word forms according to various grammatical categories. However, the main arising challenge is the concatenation of recognised syllables into the originally spoken sentence or phrase, particularly in the presence o...

متن کامل

3D Modelling of Under Ground Burried Objects Based on Ground Penetration Radar

There is a growing demand for mapping and 3D modelling of buried objects such as pipelines, agricultural hetitage, landmines and other buried objects. Usually, large scale and high resolution maps from these objects are needed. Manually map generation and modeling of these objects are cost and time consuming and is dependent on lots of resources. Therefore, automating the subsurface mapping and...

متن کامل

The Effect of Sentence-Writing Practice on Iranian low-intermediate EFL Learners’ L2 Grammatical Accuracy

This study aimed to investigate the effect of sentence writing practice on male and female low-intermediate students’ English grammatical accuracy. The question this study tried to answer does English grammatical accuracy can be affected by sentence writing practice. To find the answer to the question, 15 low intermediate level students from Kish away institute were selected. They were both mal...

متن کامل

Spelling Correction Using Context * Mohammad

This paper describes a spelling correction system that functions as part of an intelligent tutor that carries on a natural language dialogue with its users. The process that searches the lexicon is adaptive as is the system filter, to speed up the process. The basis of our approach is the interaction between the parser and the spelling corrector. Alternative correction targets are fed back to t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1709.07777  شماره 

صفحات  -

تاریخ انتشار 2017